Processing Unknown Words in HPSG
نویسندگان
چکیده
The lexical acquisition system presented in this paper incrementally updates linguistic properties of unknown words inferred from their surrounding context by parsing sentences with an HPSG grammar for German. We employ a gradual, informationbased concept of “unknownness” providing a uniform treatment for the range of completely known to maximally unknown lexical entries. “Unknown” information is viewed as revisable information, which is either generalizable or specializable. Updating takes place after parsing, which only requires a modified lexical lookup. Revisable pieces of information are identified by grammar-specified declarations which provide access paths into the parse feature structure. The updating mechanism revises the corresponding places in the lexical feature structures iff the context actually provides new information. For revising generalizable information, type union is required. A worked-out example demonstrates the inferential capacity of our implemented system.
منابع مشابه
A Statistical Approach towards Unknown Word Type Prediction for Deep Grammars
This paper presents a statistical approach to unknown word type prediction for a deep HPSG grammar. Our motivation is to enhance robustness in deep processing. With a predictor which predicts lexical types for unknown words according to the context, new lexical entries can be generated on the fly. The predictor is a maximum entropy based classifier trained on a HPSG treebank. By exploring vario...
متن کاملFast and Scalable HPSG Parsing
We investigated the efficacy of beam search parsing and deep parsing techniques in probabilistic HPSG parsing. We first tested the beam thresholding and iterative parsing. Next, we tested three techniques originally developed for deep parsing: quick check, large constituent inhibition, and hybrid parsing with a CFG chunk parser. The quick check, iterative parsing and hybrid parsing greatly cont...
متن کاملThe HPSG paradigm and the issue of locality
– from a linguistic perspective – from a formal perspective • Locality of grammatical relations in HPSG 3 HPSG grammars from a linguistic perspective From a linguistic perspective, an HPSG grammar consists of • a lexicon licensing basic words • lexical rules licensing derived words • immediate dominance (id) schemata licensing constituent structure • linear precedence (lp) statements constraini...
متن کاملAdaptability of Lexical Acquisition for Large-scale Grammars
In this paper, we demonstrate the portability of the lexical acquisition (LA) method proposed in Cholakov and van Noord (2010a). Here, LA refers to the acquisition of linguistic descriptions for words which are not listed in the lexicon of a given computational grammar, i.e., words which are unknown to this grammar. The method we discuss was originally developed for the Dutch Alpino system, and...
متن کاملOff-line Constraint Propagation for Efficient HPSG Processing
We investigate the use of a technique developed in the constraint programming community called constraint propagation to automatically make a hpsg theory more speciic at those places where linguistically motivated underspeciication would lead to ineecient processing. We discuss two concrete hpsg examples showing how oo-line constraint propagation helps improve processing eeciency.
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1998